ASRS-CMFS vs. RoBERTa: Comparing Two Pre-Trained Language Models to Predict Anomalies in Aviation Occurrence Reports with a Low Volume of In-Domain Data Available
نویسندگان
چکیده
We consider the problem of solving Natural Language Understanding (NLU) tasks characterized by domain-specific data. An effective approach consists pre-training Transformer-based language models from scratch using data before fine-tuning them on task at hand. A low volume is problematic in this context, given that performance relies heavily abundance during pre-training. To study problem, we create a benchmark replicating realistic field use to classify aviation occurrences extracted Aviation Safety Reporting System (ASRS) corpus. compare two new benchmark: ASRS-CMFS, compact model inspired RoBERTa, pre-trained only little data, and regular RoBERTa model, with no The benefits its size advantage, while ASRS-CMFS strategy. find compelling statistical evidence outperforms but show more compute-efficient than RoBERTa. suggest good strategy for NLU context scarcity.
منابع مشابه
a synchronic and diachronic approach to the change route of address terms in the two recent centuries of persian language
terms of address as an important linguistics items provide valuable information about the interlocutors, their relationship and their circumstances. this study was done to investigate the change route of persian address terms in the two recent centuries including three historical periods of qajar, pahlavi and after the islamic revolution. data were extracted from a corpus consisting 24 novels w...
15 صفحه اولfocus on communication in iranian high school language classes: a study of the role of teaching materials in changing the focus onto communication in language classes
چکیده ارتباط در کلاس به عوامل زیادی از جمله معلمان، دانش آموزان، برنامه های درسی و از همه مهم تر، مواد آموزشی وابسته است. در تدریس ارتباطی زبان که تاکید زیادی بر توانش ارتباطی دارد، کتاب درسی به عنوان عامل موثر بر پویایی کلاس محسوب میگردد که درس ها را از طریق فراهم آوردن متن ارتباط کلاسی و هم چنین نوع تمرین زبانی که دانش آموزان در طول فعالیت های کلاسی به آن مشغول اند، کنترل می کند. این حقیقت ک...
15 صفحه اولa cross-comparative dtudy between two textbook series in terms of the presentation of politeness
چکیده ندارد.
15 صفحه اولinterpersonal function of language in subtitling
translation as a comunicative process is always said to be associated with various aspects of meaning loss or gain. subtitling as a mode of translating, due to special discoursal and textual conditions imposed upon it, is believed to be an obvious case of this loss or gain. presenting the spoken sound track of a film in writing and synchronizing the perception of this text by the viewers with...
15 صفحه اولcritical period effects in foreign language learning:the influence of maturational state on the acquisition of reading,writing, and grammar in english as a foreign language
since the 1960s the age effects on learning both first and second language have been explored by many linguists and applied linguists (e.g lennerberg, 1967; schachter, 1996; long, 1990) and the existence of critical period for language acquisition was found to be a common ground of all these studies. in spite of some common findings, some issues about the impacts of age on acquiring a second or...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Aerospace
سال: 2022
ISSN: ['2226-4310']
DOI: https://doi.org/10.3390/aerospace9100591